Relating modularity maximization and stochastic block models in multilayer networks
نویسندگان
چکیده
Characterizing large-scale organization in networks, including multilayer networks, is one of the most prominent topics in network science and is important for many applications. One type of mesoscale feature is community structure, in which sets of nodes are densely connected internally but sparsely connected to other dense sets of nodes. Two of the most popular approaches for community detection are maximizing an objective function called “modularity” and performing statistical inference using stochastic block models. Generalizing work by Newman on monolayer networks [1], we show in multilayer networks that maximizing modularity is equivalent, under certain conditions, to maximizing the posterior probability of community assignments under suitably chosen stochastic block models. We derive versions of this equivalence for various types of multilayer structure, including temporal, multiplex, and multilevel (i.e., hierarchical) networks. We consider cases in which the key parameters are constant as well as ones in which they vary across layers; in the latter case, this yields a novel, layer-weighted version of the modularity function. Our results also help address a longstanding difficulty of multilayer modularity-maximization algorithms, which require the specification of two sets of tuning parameters that have been difficult to choose in practice. We show how to perform this parameter selection in a statistically-grounded way, and we demonstrate the effectiveness of our approach on several multilayer network benchmarks. Finally, we apply our method to empirical data to identify multilayer communities without having to tune parameters.
منابع مشابه
Convexified Modularity Maximization for Degree-corrected Stochastic Block Models
The stochastic block model (SBM) is a popular framework for studying community detection in networks. This model is limited by the assumption that all nodes in the same community are statistically equivalent and have equal expected degrees. The degree-corrected stochastic block model (DCSBM) is a natural extension of SBM that allows for degree heterogeneity within communities. This paper propos...
متن کاملCommunity detection in temporal multilayer networks, and its application to correlation networks
Networks are a convenient way to represent complex systems of interacting entities. Many networks contain “communities” of nodes that are more densely connected to each other than to nodes in the rest of the network. In this paper, we investigate the detection of communities in temporal networks represented as multilayer networks. As a focal example, we study time-dependent financialasset corre...
متن کاملCommunity Detection in Temporal Multilayer Networks, with an Application to Correlation Networks
Networks are a convenient way to represent complex systems of interacting entities. Many networks contain “communities” of nodes that are more densely connected to each other than to nodes in the rest of the network. In this paper, we investigate the detection of communities in temporal networks represented as multilayer networks. As a focal example, we study time-dependent financial-asset corr...
متن کاملA Sparse Completely Positive Relaxation of the Modularity Maximization for Community Detection
Abstract. In this paper, we consider the community detection problem under either the stochastic block model (SBM) assumption or the degree-correlated stochastic block model (DCSBM) assumption. The modularity maximization formulation for the community detection problem is NP-hard in general. In this paper, we propose a sparse and low-rank completely positive relaxation for the modularity maximi...
متن کاملCommunity detection in networks: Modularity optimization and maximum likelihood are equivalent
We demonstrate an exact equivalence between two widely used methods of community detection in networks, the method of modularity maximization in its generalized form which incorporates a resolution parameter controlling the size of the communities discovered, and the method of maximum likelihood applied to the special case of the stochastic block model known as the planted partition model, in w...
متن کامل